Search CORE

7 research outputs found

EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020

Author: Agerri Rodrigo
Aliprandi Carlo
Alkhalifa Rabab
Alzetta Chiara
Angel Jason
Anselmi Guido
Appiah Balaji Nitin Nikamanth
Aroyehun Segun Taofeek
Artigas Herold Maria Fernanda
Attanasio Giuseppe
Attardi Giuseppe
Badryzlova Yulia
Bai Yang
Baldissin Gioia
Ballarè Silvia
Barrón-Cedeño Alberto
Bartle Anna-Sophie
Basile Pierpaolo
Basile Valerio
Basili Roberto
Belotti Federico
Bennici Mauro
Bharathi B.
Bhuvana J.
Bianchi Federico
Bisconti Elia
Bolanos Luis
Bondielli Alessandro
Bosco Cristina
Breazzano Claudia
Brivio Matteo
Brunato Dominique
Cafagna Michele
Caputo Annalina
Caselli Tommaso
Cassotti Pierluigi
Castañeda Enrique
Castro Castro Daniel
Centeno Roberto
Cercel Dumitru-Clementin
Cerruti Massimo
Chandrabose Aravindan
Chesi Cristiano
Chiarello Filippo
Cignarella Alessandra Teresa
Cimino Andrea
Comandini Gloria
Croce Danilo
Dai Hongbing
Dascalu Mihai
Dell’Orletta Felice
Delmonte Rodolfo
Deng Tao
De Francesco Nazareno
De Martino Graziella
De Mattei Lorenzo
Di Buccio Emanuele
Di Maro Maria
di Nuovo Elisa
Di Rosa Emanuele
dos S.R. da Silva Adriano
Durante Alberto
El Abassi Samer
Espinosa María S.
Fabrizi Samuel
Fantoni Gualtiero
Ferilli Stefano
Ferraccioli Federico
Fersini Elisabetta
Finos Livio
Fiorucci Stefano
Fontana Michele
Frenda Simona
Gambino Giuseppe
Gatt Albert
Gelbukh Alexander
Giorgi Giulia
Giorgioni Simone
Girardi Paolo
Goria Eugenio
Gregori Lorenzo
Hoffmann Julia
Iacono Maria
Iovine Andrea
Izzi Giovanni Luca
Jimenez Sergio
Kaiser Jens
Kayalvizhi S.
Kivlichan Ian
Klaus Svea
Koceva Frosina
Kovács György
Kruschwitz Udo
Labadie Tamayo Roberto
Lai Mirko
Laicher Severin
Lapesa Gabriella
Lavergne Eric
Lebani Gianluca E.
Lebani Gianluca E.
Lees Alyssa
Lenci Alessandro
Leonardelli Elisa
Li Hongling
Liakata Maria
Lovetere Marco
Madonna Domenico
Massidda Riccardo
Mattei Lorenzo De
Mauri Caterina
Mele Francesco
Melucci Massimo
Menini Stefano
Miaschi Alessio
Miliani Martina
Moggio Alessio
Montagnani Matteo
Montefinese Maria
Montemagni Simonetta
Monti Johanna
Moraca Maurizio
Moretti Giovanni
Morra Simone
Murphy Killian
Muti Arianna
Nakov Preslav
Nisioi Sergiu
Nissim Malvina
Nozza Debora
Occhipinti Daniela
Ortega Bueno Reynier
Ou Xiaozhi
Palmonari Matteo
Parizzi Andrea
Pascucci Antonio
Passaro Lucia C.
Pastor Eliana
Patti Viviana
Pirrone Roberto
Polignano Marco
Politi Marcello
Pont Mattia Da
Pražák Ondřej
Proisl Thomas
Puccetti Giovanni
Přibáň Pavel
Radicioni Daniele P.
Rama Ilir
Rambelli Giulia
Ravelli Andrea Amelio
Rodrigo Alvaro
Rodriguez-Diaz Carlos A.
Rodriguez Cisnero Mariano Jason
Roman Norton T.
Roman Norton Trevisan
Rossmann Daniela
Rosso Paolo
Rotaru Armand Stefan
Rubino Edoardo
Russo Irene
Sabella Gianluca
Saini Rajkumar
Salman Samir
Sangati Federico
Sanguinetti Manuela
Sarti Gabriele
Schlechtweg Dominik
Schulte im Walde Sabine
Sciandra Andrea
Setpal Jinen
Siciliani Lucia
Solari Dario
Sorensen Jeffrey
Sorgente Antonio
Sprugnoli Rachele
Stranisci Marco
Tamburini Fabio
Taylor Stephen
Tesei Andrea
Thenmozhi D.
Tonelli Sara
Torre Ilaria
Tsakalidis Adam
Varvara Rossella
Venturi Giulia
Vettigli Giuseppe
Vlad George-Alexandru
Wang Benyou
Zaharia George-Eduard
Zamparelli Roberto
Zubiaga Arkaitz
Publication venue: 'OpenEdition'
Publication date: 11/05/2021
Field of study

Welcome to EVALITA 2020! EVALITA is the evaluation campaign of Natural Language Processing and Speech Tools for Italian. EVALITA is an initiative of the Italian Association for Computational Linguistics (AILC, http://www.ai-lc.it) and it is endorsed by the Italian Association for Artificial Intelligence (AIxIA, http://www.aixia.it) and the Italian Association for Speech Sciences (AISV, http://www.aisv.it)

OpenEdition

Die Kookkurrenz sprachlicher Strukturen

Author: Proisl Thomas
Publication venue
Publication date: 01/01/2019
Field of study

The study of cooccurrences, i. e. the analysis of linguistic units that occur together, has had a profound impact on our view of language. Not only has it contributed greatly to the insight that semi-preconstructed phrases and item-specific knowledge are central to how language works, but it has also led to improved dictionaries and teaching materials. Cooccurrences of various linguistic items have been studied under a variety of names, e. g. collocation, colligation or collostruction. While there are well-understood and fully worked out statistical models for the analysis of cooccurrences of pairs of words, no such model exists for cooccurrences of larger linguistic structures. This situation is remedied by the current work. Building on the well-understood 2 × 2 contingency tables and a graph-based representation of linguistic structures, we develop the generalized cooccurrence model, an explicit formal model for the statistical analysis of cooccurrences of arbitrary linguistic structures. Existing methods for the analysis of two-word cooccurrences and for collostructional analysis are shown to be simply special cases of the generalized cooccurrence model.Die Kookkurrenzforschung, also die Analyse des gemeinsamen Auftretens von sprachlichen Einheiten, hat unser Bild von Sprache maßgeblich beeinflusst. Sie hat nicht nur wesentlich zur der Erkenntnis beigetragen, dass „Halbfertigprodukte der Sprache“ (Hausmann, 1984: 398) und einzelwortspezifisches Wissen zentrale Elemente der Funktionsweise von Sprache sind, sondern hat auch zu besseren Wörterbüchern und Lernmaterialien geführt. Die Kookkurrenz von sprachlichen Einheiten wurde mit verschiedenen Ansätzen und unter verschiedenen Bezeichnungen wie Kollokation, Kolligation oder Kollostruktion erforscht. Während es für die Analyse von Zweiwortkookkurrenzen wohlverstandene und vollständig ausgearbeitete statistische Modelle gibt, fehlen solche Modelle für Kookkurrenzen von größeren sprachlichen Strukturen. Diese Lücke wird durch die vorliegende Arbeit geschlossen. Aufbauend auf den etablierten Vierfeldertafeln und einer graphbasierten Repräsentation sprachlicher Strukturen wird das verallgemeinerte Kookkurrenzmodell entwickelt, ein explizites formales Modell für die statistische Analyse von Kookkurrenzen beliebiger sprachlicher Strukturen. Es wird gezeigt, dass existierende Methoden zur Analyse von Zweiwortkookkurrenzen und zur Kollostruktionsanalyse lediglich Spezialfälle des verallgemeinerten Kookkurrenzmodells sind

SentiKLUE: Updating a Polarity Classifier in 48 Hours

Author: Besim Kabashi
Friedrich-alexander-universität Erlangen-nürnberg
Paul Greiner
Stefan Evert
Thomas Proisl
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2014
Field of study

SentiKLUE is an update of the KLUE po-larity classifier – which achieved good and robust results in SemEval-2013 with a sim-ple feature set – implemented in 48 hours.

CiteSeerX

Crossref

"Delta" in der stilometrischen Autorschaftsattribution

Author: Büttner Andreas
Dimpel Friedrich Michael
Evert Stefan
Jannidis Fotis
Pielström Steffen
Proisl Thomas
Reger Isabella
Schöch Christof
Vitt Thorsten
Publication venue
Publication date: 01/12/2017
Field of study

Der Artikel stellt aktuelle stilometrische Studien im Delta-Kontext vor. Diskutiert wird, warum die Verwendung des Kosinus-Abstands zu einer Verbesserung der Erfolgsquote führt; durch Experimente zur Vektornormalisierung gelingt es, die Funktionsweise von Delta besser zu verstehen. Anhand von mittelhochdeutschen Texten wird gezeigt, dass auch metrische Eigenschaften zur Autorschaftsattribution eingesetzt werden können. Zudem wird untersucht, inwieweit die mittelalterliche, nicht-normierte Schreibung die Erfolgsquote von Delta beeinflusst. Am Beispiel von arabisch-lateinischen Übersetzungen wird geprüft, inwieweit eine selektive Merkmalseliminierung dazu beitragen kann, das Übersetzersignal vom Genresignal zu isolieren.In this article, we present current stylometric studies on Delta. (1) We discuss why the use of cosine similarity improves the rate of success; our experiments on vector normalization lead to a better understanding of how Delta works. (2) Based on a corpus of Middle High German texts, we show that metrical properties can also be used for authorship attribution. The degree to which Delta is influenced by non-normalized medieval spellings is also investigated. (3) Using a corpus of Arabic-Latin translations, we explore how selective feature elimination can be used to separate the translator signal from the genre signal

Directory of Open Access Journals

Hochschulschriftenserver - Universität Frankfurt am Main